Model Selection

Large Language Model

# Large Language Model

Josiefied Qwen3 30B A3B Abliterated V2 4bit

This is a 4-bit quantized version converted from the Qwen3-30B model, suitable for text generation tasks on the MLX framework.

Large Language Model

Deepseek R1 0528 Qwen3 8B Bf16

This model is an MLX format version converted from deepseek-ai/deepseek-r1-0528-Qwen3-8B, suitable for local inference on Apple devices.

Large Language Model

PKU DS LAB.FairyR1 32B GGUF

FairyR1-32B is a large language model with 32B parameters, developed by PKU-DS-LAB, focusing on text generation tasks.

Large Language Model

Qwen3 32B 4bit DWQ

Qwen3-32B-8bit is an 8-bit quantized version of the Qwen3-32B model, designed for text generation tasks and released by mlx-community.

Large Language Model

Qwen3 235B A22B 4bit DWQ

Qwen3-235B-A22B-4bit-DWQ is a 4-bit quantized version converted from the Qwen3-235B-A22B-8bit model, suitable for text generation tasks.

Large Language Model

Avern 1.5 Mintra

Qwen2.5-Coder-7B-Instruct is a 7B-parameter code generation model based on the Qwen2.5 architecture, specializing in instruction fine-tuning, suitable for code generation and programming assistance tasks.

Large Language Model

Qwen3 235B A22B Mixed 3 6bit

This is a mixed 3-6bit quantized version converted from the Qwen/Qwen3-235B-A22B model, optimized for efficient inference on the Apple MLX framework.

Large Language Model

Qwen Qwen2.5 VL 72B Instruct GGUF

A quantized version of the Qwen2.5-VL-72B-Instruct multimodal large language model, supporting image-text-to-text tasks, suitable for various quantization levels from high precision to low memory requirements.

Text-to-Image English

Qwen3 30B A3B MNN

An MNN model exported from Qwen3-30B-A3B, featuring 4-bit quantization for efficient inference.

Large Language Model English

Qwen3 30B A3B 4bit DWQ

This is a 4-bit quantized version based on the Qwen3-30B-A3B model, created through custom DWQ quantization technology distilled from 6-bit to 4-bit, suitable for text generation tasks.

Large Language Model

Qwen3 30B A3B Gptq 8bit

Qwen3 30B A3B is a large language model that has undergone 8-bit quantization using the GPTQ method, suitable for efficient inference scenarios.

Large Language Model

Qwen3 235B A22B 4bit

This model is a 4-bit quantized version of Qwen/Qwen3-235B-A22B converted to MLX format, suitable for text generation tasks.

Large Language Model

Qwen3 30B A3B MLX 8bit

This model is converted from Qwen/Qwen3-30B-A3B to the MLX format, supporting 8-bit quantization and suitable for text generation tasks.

Large Language Model

lmstudio-community

Qwen3 30B A3B MLX 4bit

Qwen3-30B-A3B-MLX-8bit is an 8-bit quantized version converted from Qwen/Qwen3-30B-A3B, optimized for the MLX framework and suitable for text generation tasks.

Large Language Model

lmstudio-community

Qwen3-8B-bf16 is an MLX format model converted from Qwen/Qwen3-8B, supporting text generation tasks.

Large Language Model

Qwen3 30B A3B 8bit

Qwen3-30B-A3B-8bit is an MLX format conversion version of the Qwen/Qwen3-30B-A3B model, supporting efficient operation on Apple chips.

Large Language Model

Qwen3 30B A3B 4bit

Qwen3-30B-A3B-4bit is a 4-bit quantized version converted from Qwen/Qwen3-30B-A3B, suitable for efficient text generation tasks under the MLX framework.

Large Language Model

Qwen3 32B MLX 4bit

This model is a 4-bit quantized version of Qwen3-32B in MLX format, optimized for efficient operation on Apple Silicon devices.

Large Language Model

lmstudio-community

Huihui Ai.glm 4 32B 0414 Abliterated GGUF

GLM-4-32B-0414-abliterated is a large-scale language model based on the GLM architecture, with a parameter scale of 32B, suitable for text generation tasks.

Large Language Model

GLM 4 32B 0414 8bit

This model is an 8-bit quantized MLX format conversion from THUDM/GLM-4-32B-0414, supporting Chinese and English text generation tasks.

Large Language Model Supports Multiple Languages

GLM 4 32B 0414 EXL3

GLM-4-32B-0414 is a large-scale language model developed by the THUDM team, based on the GLM architecture, suitable for various text generation tasks.

Large Language Model

Qwen2.5 VL 72B Instruct FP8 Dynamic

FP8 quantized version of Qwen2.5-VL-72B-Instruct, supporting vision-text input and text output, optimized and released by Neural Magic.

Transformers English

VL Rethinker 72B 8bit

This model is a multimodal vision-language model converted from Qwen2.5-VL-7B-Instruct, supporting 8-bit quantization and suitable for visual question-answering tasks.

Transformers English

Gemma 3 27b It Qat 4bit

Gemma 3 27B IT QAT 4bit is an MLX-format model converted from Google's original model, supporting image-to-text tasks.

Transformers Other

THUDM.GLM 4 32B 0414 GGUF

GLM-4-32B-0414 is a large-scale language model developed by THUDM, with 32 billion parameters, suitable for various text generation tasks.

Large Language Model

Vora 7B Instruct

VoRA is a vision-language model based on 7B parameters, focusing on image-text-to-text conversion tasks.

VoRA is a vision-language model based on 7B parameters, capable of processing image and text inputs to generate text outputs.

All Hands.openhands Lm 32b V0.1 GGUF

OpenHands LM 32B v0.1 is a 32B-parameter open-source large language model dedicated to the free dissemination of knowledge.

Large Language Model

Deepseek Ai.deepseek V3 0324 GGUF

DeepSeek-V3-0324 is a powerful foundational model focused on text generation tasks, designed to deliver high-quality text generation capabilities.

Large Language Model

Videollama2.1 7B AV CoT

VideoLLaMA2.1-7B-AV is a multimodal large language model focused on audio-visual question answering tasks, capable of processing both video and audio inputs to provide high-quality question answering and description generation.

Transformers English

This is the 4-bit quantized version of the Qwen/QwQ-32B model, optimized using the BitsAndBytes library, suitable for text generation tasks in resource-constrained environments.

Large Language Model

Transformers English

Olmo2 8B SuperBPE T160k

An 8-billion-parameter model featuring the innovative SuperBPE tokenizer, combining subword and super tokens, achieving 30% higher inference efficiency than traditional BPE models.

Large Language Model

Transformers English

Mistral Small 3.1 24b Instruct 2503 Hf

Mistral Small 3.1 Instruct 24B is a large language model based on instruction fine-tuning, focusing on text generation tasks.

Large Language Model

Gemma 3 12b It Codeforces SFT

A large language model fine-tuned on the codeforces-cots dataset based on google/gemma-3-12b-it

Large Language Model

Gemma 3 12b It GGUF

Gemma 3 12B is a large language model that provides a quantized version in GGUF format, suitable for local deployment and use.

Large Language Model

Qwq 32B INT8 W8A8

INT8 quantized version of QWQ-32B, optimized by reducing the bit-width of weights and activations

Large Language Model

Transformers English

Gemma 3 12b Novision

A text-only version converted from google/gemma-3-12b-it, with visual components removed, focusing on text generation tasks

Large Language Model

Gemma 3 12b It GGUF

Gemma-3-12b-it is a large language model developed by Google, based on the transformer architecture, focusing on text generation tasks.

Large Language Model

Gemma 3 27b It Mlx

This is an MLX-converted version of the Google Gemma 3 27B IT model, supporting image-text-to-text tasks.

Qwq 32B Bnb 4bit

4-bit quantized version of QwQ-32B, optimized using Bitsandbytes technology, suitable for efficient inference in resource-constrained environments

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase